Automatic Processing Pipeline for Collecting and Annotating Air-Traffic Voice Communication Data

نویسندگان

چکیده

This document describes our pipeline for automatic processing of ATCO pilot audio communication we developed as part the ATCO2 project. So far, collected two thousand hours recordings that either preprocessed transcribers or used semi-supervised training. Both methods using data can further improve by retraining models. The proposed is a cascade many standalone components: (a) segmentation, (b) volume control, (c) signal-to-noise ratio filtering, (d) diarization, (e) ‘speech-to-text’ (ASR) module, (f) English language detection, (g) call-sign code recognition, (h) ATCO—pilot classification and (i) highlighting commands values. key component speech-to-text transcription system has to be trained with real-world ATC data; otherwise, performance poor. In order performance, apply both training contextual adaptation uses list plausible callsigns from surveillance auxiliary information. Downstream NLP/NLU tasks are important an application point view. These need accurate models operating on top real output; thus, there more too. Creating main aspiration At end project, will packaged distributed ELDA.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The MAMI query-by-voice experiment: collecting and annotating vocal queries for music information retrieval

The MIR research community requires coordinated strategies in dealing with databases for system development and experimentation. Manually annotated files can accelerate the development of accurate analysis tools for music information retrieval. This paper presents background information on an annotated database of vocal queries that is freely available on the Internet. First we outline the desi...

متن کامل

Air Traffic Management Simulation Data Visualization and Processing Tool

Large-scale distributed simulations in which pilot participants fly simulators through multiple airspace sectors managed by controller participants create a rich operational environment for investigating new air traffic management concepts. This paper describes a JavaTM-based tool that aids in integrating, visualizing, and transforming data collected from large-scale human-in-the-loop air traff...

متن کامل

SAVAS: Collecting, Annotating and Sharing Audiovisual Language Resources for Automatic Subtitling

This paper describes the data collection, annotation and sharing activities carried out within the FP7 EU-funded SAVAS project. The project aims to collect, share and reuse audiovisual language resources from broadcasters and subtitling companies to develop large vocabulary continuous speech recognisers in specific domains and new languages, with the purpose of solving the automated subtitling ...

متن کامل

Automatic adjustment of image processing pipeline

Mathematical processing of images in real-time vision systems can be conventionally divided into two stages: preprocessing (filtering, contrasting, protection from natural distortions, etc.) and final one (imposition, visualization, solution of the navigation task, etc.). Mentioned tasks can be solved by a lot of known and specially developed methods with various degrees of efficiency. The pres...

متن کامل

From Sequencer to Supercomputer: An Automatic Pipeline for Managing and Processing Next Generation Sequencing Data

Next Generation Sequencing is highly resource intensive. NGS Tasks related to data processing, management and analysis require high-end computing servers or even clusters. Additionally, processing NGS experiments requires suitable storage space and significant manual interaction. At The Ohio State University's Biomedical Informatics Shared Resource, we designed and implemented a scalable archit...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Engineering proceedings

سال: 2021

ISSN: ['2673-4591']

DOI: https://doi.org/10.3390/engproc2021013008